Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning

Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning

作者: Benjamin Bengfort Rebecca Bilbro Tony Ojeda
出版社: O'Reilly
出版在: 2018-07-01
ISBN-13: 9781491963043
ISBN-10: 1491963042
裝訂格式: Paperback
總頁數: 332 頁





內容描述


From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical book presents a data scientist’s approach to building language-aware products with applied machine learning.
You’ll learn robust, repeatable, and scalable techniques for text analysis with Python, including contextual and linguistic feature engineering, vectorization, classification, topic modeling, entity resolution, graph analysis, and visual steering. By the end of the book, you’ll be equipped with practical methods to solve any number of complex real-world problems.

Preprocess and vectorize text into high-dimensional feature representations
Perform document classification and topic modeling
Steer the model selection process with visual diagnostics
Extract key phrases, named entities, and graph structures to reason about data in text
Build a dialog framework to enable chatbots and language-driven interaction
Use Spark to scale processing power and neural networks to scale model complexity




相關書籍

Pandas for Everyone: Python Data Analysis (Addison-Wesley Data & Analytics Series)

作者 Daniel Y. Chen

2018-07-01

精通資料分析|使用 Excel、Python 和 R (Advancing Into Analytics: From Excel to Python and R)

作者 George Mount 沈佩誼 譯

2018-07-01

Python數據分析從入門到精通

作者 李梓萌

2018-07-01